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SUMMARY OF WORK ACCOMPLISHED 


During the years 1990-1993 our research project “Functional Statistical Data Analysis 
and Modeling” was unusually productive because our research on Functional Statistical 
Methods (based on the unifying idea of comparison density functions) culminated in, and 
was clarified bv, the development of a new research area which we call Change Analysis; 
it is an extension of Change-Point estimation which has an extensive literature. 

Change detection and estimation should emerge as a major research field at the leading 
edge of research in theoretical and applied statistics; evidence of this is the several major 
workshops in 1992-1993 (AMS-IMS-SIAM. Carleton University, University of Maryland) 
in which we actively participated, and in whose proceedings we have papers. 

Our research during 1990-1993 was reported in 13 published papers and one Ph.D. 
thesis listed below. We regard our research as concerned with (numbers in parentheses 
represent number of papers in the area): 

1. Change Analysis (4) 

2. Functional Inference (4) 

3. Time Series Analysis (2) 

4. Statistical Culture and History (3) 

5. Supervision of Ph.D. theses. 

Change analysis, and change-point estimation, extend standard statistical methods 

(which estimate the parameters of a probability model for a data set) by testing if there 
is a change in the parameters over the data, and modeling how probability distributions 
Eire changing. Our approach to change analysis extends tests for independence of two 
random variables X and Y ; in change problems Y is the observation and X is the index 
of observation. Our papers introduce the theory and practice of a new non-parametric 
approach called the Comparison Change approach, and a new parametric approach called 
Fisher Score Change Processes. An appendix outlines the new functions introduced in 
our theory, and our current research on applying change concepts to the presentation of 
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standard statistical methods and the development of “beyond AOV” techniques. 

Functional inference adds to standard statistical methods data analysis methods based • 

on quantile domain functions, information measures, and comparison density functions. 

Important results in our papers include: new goodness of fit tests based on entropy and 
comparison density functions; unification of statistical methods for continuous mid discrete • 

data enhanced by graphical methods inspired by change analysis; Change PP plot and 
continuous versions of sample quanntile functions. 

Time series analysis is still an extremely active and important interdisciplinary re- • 

search area. Recent developments are described in two recent major collections of papers 
which include papers by us. We continued to provide leadership in this field by organiz¬ 
ing the U.S. Japan Joint Seminar on Statistical Time Series Analysis, Honolulu, Hawaii. • 

January 24-29. 1993. 

Statistical culture and history is important (it should not be neglected by statisticians 
in an era where change, and continuous reviews of research directions to determine needed • 

changes or reforms, is increasingly normal). Our papers discuss: introductory probability 
in the era of the 1960's as reflected in textbooks by Mosteller; the career of Hirotugu 
Akaike (a role model for the conduct of interdiscplinary statistical research); the concept • 

of “hammers and nails ’ in the practice of statistics; proposals for improving the impact of 
statistics. 

In the area of supervision of Ph.D. theses, Cheng Cheng received his Ph.D. in Au- • 

gust 1993 with research in the areas of our research program (he is currently with Upjohn 
Laboratories in Kalamazoo, Michigan, as a post-doctoral fellow in molecular similarity). 

Cheng’s Ph.D. thesis, entitled On estimation of Quantile and Quantile Density Functions , • 

has 300+ pages and presents major results in Functional Statistical Inference. It is out¬ 
standing in scholarship (it provides a comprehensive review of many quantile estimators 
proposed in the literature) and original research (it proposes new estimators for practi- • 

cal estimation). In the pipe-line are two Ph. D. students (Todd Ogden and Ying-Sheng 
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Hu) who are currently doing reseasrch on change analysis estimation methods which 
function approximation by wavelets. 


APPENDIX 


A. Comparison Change Analysis Basic Identities Formal Derivation 

We believe that the relations between various comparison functions can best be 
derstood by formally deriving them from the conditional distribution identity 

Fxy(x.y) = J F Y \X=x'(y)' iF x( j: ')- 

The independence testing function D(t,u) = F\- y(Qx( T ), Qy( u )) has representation 

fQx(r) 

F X,y(Qx( t )'Qy( u )) = / F Y \x =x >{QY(u))dFx(x ) 

J —00 

Make change of variable t — Fy(i'), J = Qx( f )- Assume r .u exact. Then 

Fx,y(Qx( t )'Qy( u )) ~ F Y \x=Q x (t)(QY( u )) dt - 

A similar derivation shows 


Fx,y(Qx( t ), 



Fx\Y=Qy(W)(Qx( T ))du . 


Comparison functions 


D(t:u) ~ F Y \x-q x (i)(Qy( u )) 

D([0, r];u) = Fy-|A'<Qx(r)(£M u )) 

satisfy Change Analysis Identities: 

D(t,u) = tD((0,t];u) 

= t f d([0,T]\u)du' 

JO 

D(t,u) = j D(t\u)dt 

Jo 

= f dt j du'd(t,u) 

Jo Jo 


use 


un- 
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assuming in discrete case that r and u are "exact” values. 

Comparison densities satisfy 

d(t,u) = d(u;F y ,F y jx=(J xi r,) 

~ d ( t ’F\,F X f y-Q y ( u )) 

rd([0,r],u) = f d(t,u)dt 
JO 

rd( (0, t],u) = P[X < Qa'( t )I^ = 

~ F X\Y=Q y (u^Qx( t )) 

B. COMPARISON CHANGE AND QUANTILE ANALYSIS APPROACH 
TO UNIFICATION OF STATISTICAL METHODS 
AND BEYOND AOV TECHNIQUES 

Introduction 

This paper describes the new concepts of our research (Parzen (1979). (1989), (1992). 
(1993)) on a comprehensive development of change and quantile analysis function smooth¬ 
ing methods for statistical data analysis and tests of homegeneity of univariate, bivariate, 
and multi-sample data sets. The aim of the paper is to outline methods and techniques we 
have developed, summarize proposed graphs, discuss examples, and hopefully demonstrate 
that a theory extending statistical methods provides benefits of (1) Utility: providing new 
methods which may be break throughs, in a sense that they help solve problems untouched 
by standard methods, and (2) Beauty: enhance understanding of standard methods. 

Our research goal is to unify conventional and new statistical methodologies, such as 
those which Hirotsu (1993) calls “beyond analysis of variance (AOV) techniques”. We pro¬ 
pose a framework which integrates conventional (AOV and non-parametric) and emerging 
(beyond AOV) statistical techniques. 

The paper has several parts: I. Univariate quantile data analysis; II. Univariate 
Change-point analysis; III. Multisample data analysis. EXPLORE: an S-plus and Fortran 
library of computer programs for data analysis developed by Cheng Cheng is described in 
an appendix. Figures of functional statistics are all at the end of the paper. 
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ABSTRACT 


On Estimation of Quantile and Quantile Density Functions. (August 1993) 
Cheng Cheng, B.S., Beijing Computer Institute; 

M.S., University of Texas at El Paso 
Chair of Advisory Committee: Dr. Emanuel Parzen 

In statistical analysis the quantile function (qf) and the quantile density 
function (qdf) are equally important as the cdf and the pdf. Numerous smooth 
quantile function estimators have been proposed as alternatives to the sample 
quantile function. The research in this area has been somewhat esoteric in the 
sense that little work has been done in terms of providing guidelines for practice. 
This research is motivated by several practical issues involved in quantile function 
estimation. 

In the attempt to compare various qf estimators and to provide guidelines 
for their uses in practice, asymptotic behavior of translation and scale equivari- 
ant qf estimators and the derived qdf estimators are first investigated under a 
unified representation. It is shown that under moderate conditions these estima¬ 
tors share equally good asymptotic behavior. Some existing asymptotic results 
are strengthened and extended. Sufficient conditions for the consistency of the 
derived qdf estimators are obtained as well. 

The practical impact of the asymptotic results is the demonstration of the 
fact that smoothing the sample quantile function by any reasonable kernel always 
results in a qf estimator possessing nice asymptotic behavior. So asymptotic 
theory is not informative for obtaining guidelines for the selection of kernels in 
practice (finite samples). Finite-sample behavior of several translation and scale 
equivariant qf estimators is then formulated and compared according to several 
criteria. The finite sample results, in conjunction with the asymptotics, clearly 
suggest certain smoothing kernels to use in practice. 

A data-driven procedure to determine amount of smoothing is proposed. 
State-of-the-art demonstrations of the quantile-domain data analysis methodol¬ 
ogy are presented by analyzing several well-known data sets. 
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